The Estimation of Prediction Error: Covariance Penalties and Cross-Validation
نویسنده
چکیده
Having constructed a data-based estimation rule, perhaps a logistic regression or a classification tree, the statistician would like to know its performance as a predictor of future cases. There are two main theories concerning prediction error: (1) penalty methods such as Cp, Akaike’s information criterion, and Stein’s unbiased risk estimate that depend on the covariance between data points and their corresponding predictions; and (2) cross-validation and related nonparametric bootstrap techniques. This article concerns the connection between the two theories. A Rao–Blackwell type of relation is derived in which nonparametric methods such as cross-validation are seen to be randomized versions of their covariance penalty counterparts. The model-based penalty methods offer substantially better accuracy, assuming that the model is believable.
منابع مشابه
Assessing Prediction Error of Nonparametric Regression and Classification under Bregman Divergence
Prediction error is critical to assessing the performance of statistical methods and selecting statistical models. We propose the cross-validation and approximated cross-validation methods for estimating prediction error under a broad q-class of Bregman divergence for error measures which embeds nearly all of the commonly used loss functions in regression, classification procedures and machine ...
متن کاملAsymptotic analysis of covariance parameter estimation for Gaussian processes in the misspecified case
In parametric estimation of covariance function of Gaussian processes, it is often the case that the true covariance function does not belong to the parametric set used for estimation. This situation is called the misspecified case. In this case, it has been shown that, for irregular spatial sampling of observation points, Cross Validation can yield smaller prediction errors than Maximum Likeli...
متن کاملFrom Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation
In the field of statistical prediction, the tasks of model selection and model evaluation have received extensive treatment in the literature. Among the possible approaches for model selection and evaluation are those based on covariance penalties, which date back to at least 1960s, and are still widely used today. Most of the literature on this topic is based on what we call the “Fixed-X” assu...
متن کاملAsymptotic analysis of the role of spatial sampling for covariance parameter estimation of Gaussian processes
Covariance parameter estimation of Gaussian processes is analyzed in an asymptotic framework. The spatial sampling is a randomly perturbed regular grid and its deviation from the perfect regular grid is controlled by a single scalar regularity parameter. Consistency and asymptotic normality are proved for the Maximum Likelihood and Cross Validation estimators of the covariance parameters. The a...
متن کاملEvaluation of co-kriging different methods for rainfall estimation in arid region (Central Kavir basin in Iran)
Rainfall is considered a highly valuable climatologic resource, particularly in arid regions. As one of the primaryinputs that drive watershed dynamics, rainfall has been shown to be crucial for accurate distributed hydrologicmodeling. Precipitation is known only at certain locations; interpolation procedures are needed to predict this variablein other regions. In this study, the ordinary cokri...
متن کامل